A Method for Closed Frequent Subgraph Mining in a Single Large Graph

نویسندگان

چکیده

Mining frequent subgraphs is an interesting and important problem in the graph mining field, that from a single large has been strongly developed, recently attracted many researchers. Among them, MNI-based approaches are considered as state-of-the-art, such GraMi algorithm. Besides subgraph (FSM), closed was also developed. This practical applications fundamental premise for studies. paper proposes CloGraMi (Closed Frequent Subgraph Mining) algorithm based on to find all graph. Two effective strategies first one new level order traversal strategy quickly determine searching process, second setting condition early pruning portion of non-closed candidates, both them aim reduce running time well memory requirements, improve performance proposed Our experiments performed five real datasets (both directed undirected graphs) results show requirements our significantly better than those GraMi-based

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Parallel Approach for Frequent Subgraph Mining in a Single Large Graph Using Spark

Frequent subgraph mining (FSM) plays an important role in graph mining, attracting a great deal of attention in many areas, such as bioinformatics, web data mining and social networks. In this paper, we propose SSIGRAM (Spark based Single Graph Mining), a Spark based parallel frequent subgraph mining algorithm in a single large graph. Aiming to approach the two computational challenges of FSM, ...

متن کامل

GRAMI: Frequent Subgraph and Pattern Mining in a Single Large Graph

Mining frequent subgraphs is an important operation on graphs; it is defined as finding all subgraphs that appear frequently in a database according to a given frequency threshold. Most existing work assumes a database of many small graphs, but modern applications, such as social networks, citation graphs, or proteinprotein interactions in bioinformatics, are modeled as a single large graph. In...

متن کامل

Support Computation for Mining Frequent Subgraphs in a Single Graph

Defining the support (or frequency) of a subgraph is trivial when a database of graphs is given: it is simply the number of graphs in the database that contain the subgraph. However, if the input is one large graph, it is surprisingly difficult to find an appropriate support definition. In this paper we study the core problem, namely overlapping embeddings of the subgraph, in detail and suggest...

متن کامل

A hybrid method for frequent closed patterns mining in large-scale high dimensional biological data

Large high dimensional biological data have posed great challenges to most existing algorithms for frequent patterns mining. In this paper, we propose a hybrid method to find all frequent patterns: we firstly decompose the mining task into two subproblems (discovering long frequent patterns and short frequent patterns), then we choose different algorithms to solve the subproblems respectively. ...

متن کامل

Frequent Subgraph Mining from Streams of Linked Graph Structured Data

Nowadays, high volumes of high-value data (e.g., semantic web data) can be generated and published at a high velocity. A collection of these data can be viewed as a big, interlinked, dynamic graph structure of linked resources. Embedded in them are implicit, previously unknown, and potentially useful knowledge. Hence, efficient knowledge discovery algorithms for mining frequent subgraphs from t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3133666